Pesquisa | Portal Regional da BVS

1.

Deep Generative Adversarial Reinforcement Learning for Semi-Supervised Segmentation of Low-Contrast and Small Objects in Medical Images.

Xu, Chenchu; Zhang, Tong; Zhang, Dong; Zhang, Dingwen; Han, Junwei.

IEEE Trans Med Imaging ; PP2024 Apr 01.

Artigo em Inglês | MEDLINE | ID: mdl-38557623

RESUMO

Deep reinforcement learning (DRL) has demonstrated impressive performance in medical image segmentation, particularly for low-contrast and small medical objects. However, current DRL-based segmentation methods face limitations due to the optimization of error propagation in two separate stages and the need for a significant amount of labeled data. In this paper, we propose a novel deep generative adversarial reinforcement learning (DGARL) approach that, for the first time, enables end-to-end semi-supervised medical image segmentation in the DRL domain. DGARL ingeniously establishes a pipeline that integrates DRL and generative adversarial networks (GANs) to optimize both detection and segmentation tasks holistically while mutually enhancing each other. Specifically, DGARL introduces two innovative components to facilitate this integration in semi-supervised settings. First, a task-joint GAN with two discriminators links the detection results to the GAN's segmentation performance evaluation, allowing simultaneous joint evaluation and feedback. This ensures that DRL and GAN can be directly optimized based on each other's results. Second, a bidirectional exploration DRL integrates backward exploration and forward exploration to ensure the DRL agent explores the correct direction when forward exploration is disabled due to lack of explicit rewards. This mitigates the issue of unlabeled data being unable to provide rewards and rendering DRL unexplorable. Comprehensive experiments on three generalization datasets, comprising a total of 640 patients, demonstrate that our novel DGARL achieves 85.02% Dice and improves at least 1.91% for brain tumors, achieves 73.18% Dice and improves at least 4.28% for liver tumors, and achieves 70.85% Dice and improves at least 2.73% for pancreas compared to the ten most recent advanced methods, our results attest to the superiority of DGARL. Code is available at GitHub.

2.

Species -shared and -unique gyral peaks on human and macaque brains.

Zhang, Songyao; Zhang, Tuo; Cao, Guannan; Zhou, Jingchao; He, Zhibin; Li, Xiao; Ren, Yudan; Liu, Tao; Jiang, Xi; Guo, Lei; Han, Junwei; Liu, Tianming.

Elife ; 122024 Apr 18.

Artigo em Inglês | MEDLINE | ID: mdl-38635322

RESUMO

Cortical folding is an important feature of primate brains that plays a crucial role in various cognitive and behavioral processes. Extensive research has revealed both similarities and differences in folding morphology and brain function among primates including macaque and human. The folding morphology is the basis of brain function, making cross-species studies on folding morphology important for understanding brain function and species evolution. However, prior studies on cross-species folding morphology mainly focused on partial regions of the cortex instead of the entire brain. Previously, our research defined a whole-brain landmark based on folding morphology: the gyral peak. It was found to exist stably across individuals and ages in both human and macaque brains. Shared and unique gyral peaks in human and macaque are identified in this study, and their similarities and differences in spatial distribution, anatomical morphology, and functional connectivity were also dicussed.

Assuntos

Encéfalo , Macaca , Animais , Humanos

3.

Diverse and asymmetric patterns of single-neuron projectome in regulating interhemispheric connectivity.

Fei, Yao; Wu, Qihang; Zhao, Shijie; Song, Kun; Han, Junwei; Liu, Cirong.

Nat Commun ; 15(1): 3403, 2024 Apr 22.

Artigo em Inglês | MEDLINE | ID: mdl-38649683

RESUMO

The corpus callosum, historically considered primarily for homotopic connections, supports many heterotopic connections, indicating complex interhemispheric connectivity. Understanding this complexity is crucial yet challenging due to diverse cell-specific wiring patterns. Here, we utilized public AAV bulk tracing and single-neuron tracing data to delineate the anatomical connection patterns of mouse brains and conducted wide-field calcium imaging to assess functional connectivity across various brain states in male mice. The single-neuron data uncovered complex and dense interconnected patterns, particularly for interhemispheric-heterotopic connections. We proposed a metric "heterogeneity" to quantify the complexity of the connection patterns. Computational modeling of these patterns suggested that the heterogeneity of upstream projections impacted downstream homotopic functional connectivity. Furthermore, higher heterogeneity observed in interhemispheric-heterotopic projections would cause lower strength but higher stability in functional connectivity than their intrahemispheric counterparts. These findings were corroborated by our wide-field functional imaging data, underscoring the important role of heterotopic-projection heterogeneity in interhemispheric communication.

Assuntos

Corpo Caloso , Neurônios , Animais , Corpo Caloso/fisiologia , Masculino , Camundongos , Neurônios/fisiologia , Vias Neurais/fisiologia , Conectoma , Encéfalo/fisiologia , Simulação por Computador , Modelos Neurológicos , Rede Nervosa/fisiologia , Cálcio/metabolismo

4.

Vertical-Channel Cathode Host Enables Rapid Deposition Kinetics toward High-Areal-Capacity Sodium-Chlorine Batteries.

Ma, Chenyu; Feng, Wenting; Kong, Debin; Wei, Xinru; Gong, Xuelei; Yang, Jianhang; Han, Junwei; Zhi, Linjie.

Small ; : e2310978, 2024 Mar 21.

Artigo em Inglês | MEDLINE | ID: mdl-38513253

RESUMO

Rechargeable sodium chloride (Na-Cl2) batteries have emerged as promising alternatives for next-generation energy storage due to their superior energy density and sodium abundance. However, their practical applications are hindered by the sluggish chlorine cathode kinetics related to the aggregation of NaCl and its difficult transformation into Cl2. Herein, the study, for the first time from the perspective of electrode level in Na-Cl2 batteries, proposes a free-standing carbon cathode host with customized vertical channels to facilitate the SOCl2 transport and regulate the NaCl deposition. Accordingly, electrode kinetics are significantly enhanced, and the deposited NaCl is distributed evenly across the whole electrode, avoiding the blockage of pores in the carbon host, and facilitating its oxidation to Cl2. With this low-polarization cathode, the Na-Cl2 batteries can deliver a practically high areal capacity approaching 4 mAh cm-2 and a long cycle life of over 170 cycles. This work demonstrates the significance of pore engineering in electrodes for mediating chlorine conversion kinetics in rechargeable alkali-metal-Cl2 batteries.

5.

Resource recovery and regeneration strategies for spent lithium-ion batteries: Toward sustainable high-value cathode materials.

Gu, Kunhong; Tokoro, Chiharu; Takaya, Yutaro; Zhou, Jiang; Qin, Wenqing; Han, Junwei.

Waste Manag ; 179: 120-129, 2024 Apr 30.

Artigo em Inglês | MEDLINE | ID: mdl-38471250

RESUMO

Traditional cathode recycling methods have become outdated amid growing concerns for high-value output and environmental friendliness in spent Li-ion battery (LIB) recycling. Our study presents a closed-loop approach that involves selective sulfurization roasting, water leaching, and regeneration, efficiently transforming spent ternary Li batteries (i.e., NCM) into high-performance cathode materials. By combining experimental investigations with density functional theory (DFT) calculations, we elucidate the mechanisms within the NCM-C-S roasting system, providing a theoretical foundation for selective sulfidation. Utilizing in situ X-ray diffraction techniques and a series of consecutive experiments, the study meticulously tracks the evolution of regenerating cathode materials that use transition metal sulfides as their primary raw materials. The Li-rich regenerated NCM exhibits exceptional electrochemical performance, including long-term cycling, high-rate capabilities, reversibility, and stability. The closed-loop approach highlights the sustainability and environmental friendliness of this recycling process, with potential applications in other cathode materials, such as LiCoO2 and LiMn2O4. Compared with traditional methods, this short process approach avoids the complexity of leaching, solvent extraction, and reverse extraction, significantly increasing metal utilization and Li recovery rates while reducing pollution and resource waste.

Assuntos

Lítio , Metais , Fontes de Energia Elétrica , Eletrodos , Reciclagem , Íons

6.

Task sub-type states decoding via group deep bidirectional recurrent neural network.

Zhao, Shijie; Fang, Long; Yang, Yang; Tang, Guochang; Luo, Guoxin; Han, Junwei; Liu, Tianming; Hu, Xintao.

Med Image Anal ; 94: 103136, 2024 May.

Artigo em Inglês | MEDLINE | ID: mdl-38489895

RESUMO

Decoding brain states under different cognitive tasks from functional magnetic resonance imaging (fMRI) data has attracted great attention in the neuroimaging filed. However, the well-known temporal dependency in fMRI sequences has not been fully exploited in existing studies, due to the limited temporal-modeling capacity of the backbone machine learning algorithms and rigid training sample organization strategies upon which the brain decoding methods are built. To address these limitations, we propose a novel method for fine-grain brain state decoding, namely, group deep bidirectional recurrent neural network (Group-DBRNN) model. We first propose a training sample organization strategy that consists of a group-task sample generation module and a multiple-scale random fragment strategy (MRFS) module to collect training samples that contain rich task-relevant brain activity contrast (i.e., the comparison of neural activity patterns between different tasks) and maintain the temporal dependency. We then develop a novel decoding model by replacing the unidirectional RNNs that are widely used in existing brain state decoding studies with bidirectional stacked RNNs to better capture the temporal dependency, and by introducing a multi-task interaction layer (MTIL) module to effectively model the task-relevant brain activity contrast. Our experimental results on the Human Connectome Project task fMRI dataset (7 tasks consisting of 23 task sub-type states) show that the proposed model achieves an average decoding accuracy of 94.7% over the 23 fine-grain sub-type states. Meanwhile, our extensive interpretations of the intermediate features learned in the proposed model via visualizations and quantitative assessments of their discriminability and inter-subject alignment evidence that the proposed model can effectively capture the temporal dependency and task-relevant contrast.

Assuntos

Encéfalo , Conectoma , Humanos , Encéfalo/diagnóstico por imagem , Redes Neurais de Computação , Conectoma/métodos , Algoritmos , Imageamento por Ressonância Magnética/métodos

7.

An Inductive Reasoning Model based on Interpretable Logical Rules over temporal knowledge graph.

Mei, Xin; Yang, Libin; Jiang, Zuowei; Cai, Xiaoyan; Gao, Dehong; Han, Junwei; Pan, Shirui.

Neural Netw ; 174: 106219, 2024 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-38442489

RESUMO

Extrapolating future events based on historical information in temporal knowledge graphs (TKGs) holds significant research value and practical applications. In this field, the methods currently utilized can be classified as either embedding-based or logical rule-based. Embedding-based methods depend on learned entity and relation embeddings for prediction, but they suffer from the lack of interpretability due to the opaque reasoning process. On the other hand, logical rule-based methods face scalability challenges as they heavily rely on predefined logical rules. To overcome these limitations, we propose a hybrid model that combines embedding-based and logical rule-based methods to capture deep causal logic. Our model, called the Inductive Reasoning Model based on Interpretable Logical Rule (ILR-IR), aims to provide interpretable insights while effectively predicting future events in TKGs. ILR-IR delves into historical information, extracting valuable insights from logical rules embedded within relations and interaction preferences between entities. By considering both logical rules and interaction preferences, ILR-IR offers a comprehensive perspective for predicting future events. In addition, we propose the incorporation of a one-class augmented matching loss during optimization, which serves to enhance performance of the model during training. We evaluate ILR-IR on multiple datasets, including ICEWS14, ICEWS0515, and ICEWS18. Experimental results demonstrate that ILR-IR outperforms state-of-the-art baselines, showcasing its superior performance in TKG extrapolation reasoning. Moreover, ILR-IR demonstrates remarkable generalization capabilities, even when applied to related datasets that share a common relation vocabulary. This suggests that our proposed model exhibits robust zero-shot reasoning abilities. For interested parties, we have made our code publicly available at https://github.com/mxadorable/ILR-IR.

Assuntos

Reconhecimento Automatizado de Padrão , Resolução de Problemas , Aprendizagem , Generalização Psicológica , Conhecimento

8.

Innovative methodology for comprehensive utilization of arsenic-bearing neutralization sludge.

Zhang, Tianfu; Han, Junwei; Dong, Liuyang; Liu, Dianwen; Jiao, Fen; Qin, Wenqing; Liu, Wei.

J Environ Manage ; 353: 120148, 2024 Feb 27.

Artigo em Inglês | MEDLINE | ID: mdl-38306856

RESUMO

Arsenic-bearing neutralization (ABN) sludge is a classical hazardous waste commonly found in nonferrous metallurgy. However, the current storage of these hazardous wastes not only has to pay costly hazardous waste taxes but also poses significant risks to both the environment and human health. To address these issues and achieve the comprehensive utilization and minimization of ABN sludge, this study proposes a new combined process. The process involves selective reduction roasting, leaching, and carbonation, through which, the arsenate and gypsum in the ABN sludge were recovered in the form of As(s), high-purity CaCO3, and H2S. The selective reduction behaviors of arsenate and gypsum were investigated through thermodynamic analysis and roasting experiments. The results indicated that the 95.35 % arsenate and 96.55 % gypsum in the sludge were selectively reduced to As4(g) and CaS at 950 °C by carbothermic reduction. The As4(g) was condensed to As(s) and enriched in the dust (As, 96.78 wt %). In the leaching process, H2S gas was adopted to promote the leaching of CaS, and resulted in 97.41 % of CaS in the roasted product was selectively leached in the form of Ca(HS)2, leading to a 74.11 % reduction in the weight of the ABN sludge. Then, the Ca(HS)2 was subjected to capture CO2 for the separation of Ca2+ and S2-. The result depicted that 99.69 % of Ca2+ and 99.12 % of S2- were separated as high-purity (99.12 wt %) CaCO3 and H2S (24.89 vol %) by controlling the terminal carbonation pH to below 6.55. The generated H2S can be economically converted to sulfur by the Clause process. The whole process realized the comprehensive resource recovery and the minimization of the sludge, which provides an alternative solution for the clean treatment of hazardous ABN waste.

Assuntos

Arsênio , Humanos , Arsênio/análise , Esgotos , Arseniatos , Sulfato de Cálcio , Resíduos Perigosos

9.

Uncertainty Modeling for Gaze Estimation.

Zhong, Wenqi; Xia, Chen; Zhang, Dingwen; Han, Junwei.

IEEE Trans Image Process ; 33: 2851-2866, 2024.

Artigo em Inglês | MEDLINE | ID: mdl-38358877

RESUMO

Gaze estimation is an important fundamental task in computer vision and medical research. Existing works have explored various effective paradigms and modules for precisely predicting eye gazes. However, the uncertainty for gaze estimation, e.g., input uncertainty and annotation uncertainty, have been neglected in previous research. Existing models use a deterministic function to estimate the gaze, which cannot reflect the actual situation in gaze estimation. To address this issue, we propose a probabilistic framework for gaze estimation by modeling the input uncertainty and annotation uncertainty. We first utilize probabilistic embeddings to model the input uncertainty, representing the input image as a Gaussian distribution in the embedding space. Based on the input uncertainty modeling, we give an instance-wise uncertainty estimation to measure the confidence of prediction results, which is critical in practical applications. Then, we propose a new label distribution learning method, probabilistic annotations, to model the annotation uncertainty, representing the raw hard labels as Gaussian distributions. In addition, we develop an Embedding Distribution Smoothing (EDS) module and a hard example mining method to improve the consistency between embedding distribution and label distribution. We conduct extensive experiments, demonstrating that the proposed approach achieves significant improvements over baseline and state-of-the-art methods on two widely used benchmark datasets, GazeCapture and MPIIFaceGaze, as well as our collected dataset using mobile devices.

10.

Impaired cerebral microvascular endothelial cells integrity due to elevated dopamine in myasthenic model.

Hao, Yue; Su, Yinchun; He, Yifan; Zhang, Wenyuan; Liu, Yang; Guo, Yu; Chen, Xingfan; Liu, Chunhan; Han, Siyu; Wang, Buyi; Liu, Yushuang; Zhao, Wei; Mu, Lili; Wang, Jinghua; Peng, Haisheng; Han, Junwei; Kong, Qingfei.

J Neuroinflammation ; 21(1): 10, 2024 Jan 04.

Artigo em Inglês | MEDLINE | ID: mdl-38178152

RESUMO

Myasthenia gravis is an autoimmune disease characterized by pathogenic antibodies that target structures of the neuromuscular junction. However, some patients also experience autonomic dysfunction, anxiety, depression, and other neurological symptoms, suggesting the complex nature of the neurological manifestations. With the aim of explaining the symptoms related to the central nervous system, we utilized a rat model to investigate the impact of dopamine signaling in the central nervous and peripheral circulation. We adopted several screening methods, including western blot, quantitative PCR, mass spectrum technique, immunohistochemistry, immunofluorescence staining, and flow cytometry. In this study, we observed increased and activated dopamine signaling in both the central nervous system and peripheral circulation of myasthenia gravis rats. Furthermore, changes in the expression of two key molecules, Claudin5 and CD31, in endothelial cells of the blood-brain barrier were also examined in these rats. We also confirmed that dopamine incubation reduced the expression of ZO1, Claudin5, and CD31 in endothelial cells by inhibiting the Wnt/ß-catenin signaling pathway. Overall, this study provides novel evidence suggesting that pathologically elevated dopamine in both the central nervous and peripheral circulation of myasthenia gravis rats impair brain-blood barrier integrity by inhibiting junction protein expression in brain microvascular endothelial cells through the Wnt/ß-catenin pathway.

Assuntos

Dopamina , Miastenia Gravis , Humanos , Ratos , Animais , Dopamina/metabolismo , Células Endoteliais/metabolismo , Encéfalo , Barreira Hematoencefálica/metabolismo , Via de Sinalização Wnt/fisiologia , Miastenia Gravis/metabolismo

11.

Frequency-specific functional difference between gyri and sulci in naturalistic paradigm fMRI.

Wang, Liting; Yang, Yang; Hu, Xintao; Zhao, Shijie; Jiang, Xi; Guo, Lei; Han, Junwei; Liu, Tianming.

Brain Struct Funct ; 229(2): 431-442, 2024 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-38193918

RESUMO

Disentangling functional difference between cortical folding patterns of gyri and sulci provides novel insights into the relationship between brain structure and function. Previous studies using resting-state functional magnetic resonance imaging (rsfMRI) have revealed that sulcal signals exhibit stronger high-frequency but weaker low-frequency components compared to gyral ones, suggesting that gyri may serve as functional integration centers while sulci are segregated local processing units. In this study, we utilize naturalistic paradigm fMRI (nfMRI) to explore the functional difference between gyri and sulci as it has proven to record stronger functional integrations compared to rsfMRI. We adopt a convolutional neural network (CNN) to classify gyral and sulcal fMRI signals in the whole brain (the global model) and within functional brain networks (the local models). The frequency-specific difference between gyri and sulci is then inferred from the power spectral density (PSD) profiles of the learned filters in the CNN model. Our experimental results show that nfMRI shows higher gyral-sulcal PSD contrast effect sizes in the global model compared to rsfMRI. In the local models, the effect sizes are either increased or decreased depending on frequency bands and functional complexity of the FBNs. This study highlights the advantages of nfMRI in depicting the functional difference between gyri and sulci, and provides novel insights into unraveling the relationship between brain structure and function.

Assuntos

Córtex Cerebral , Imageamento por Ressonância Magnética , Imageamento por Ressonância Magnética/métodos , Encéfalo/diagnóstico por imagem , Redes Neurais de Computação , Cabeça

12.

Identification of Genetic Risk Factors Based on Disease Progression Derived From Longitudinal Brain Imaging Phenotypes.

Du, Lei; Zhao, Ying; Zhang, Jianting; Shang, Muheng; Zhang, Jin; Han, Junwei.

IEEE Trans Med Imaging ; 43(3): 928-939, 2024 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-37847615

RESUMO

Neurodegenerative disorders usually happen stage-by-stage rather than overnight. Thus, cross-sectional brain imaging genetic methods could be insufficient to identify genetic risk factors. Repeatedly collecting imaging data over time appears to solve the problem. But most existing imaging genetic methods only use longitudinal imaging phenotypes straightforwardly, ignoring the disease progression trajectory which might be a more stable disease signature. In this paper, we propose a novel sparse multi-task mixed-effects longitudinal imaging genetic method (SMMLING). In our model, disease progression fitting and genetic risk factors identification are conducted jointly. Specifically, SMMLING models the disease progression using longitudinal imaging phenotypes, and then associates fitted disease progression with genetic variations. The baseline status and changing rate, i.e., the intercept and slope, of the progression trajectory thus shoulder the responsibility to discover loci of interest, which would have superior and stable performance. To facilitate the interpretation and stability, we employ l2,1 -norm and the fused group lasso (FGL) penalty to identify loci at both the individual level and group level. SMMLING can be solved by an efficient optimization algorithm which is guaranteed to converge to the global optimum. We evaluate SMMLING on synthetic data and real longitudinal neuroimaging genetic data. Both results show that, compared to existing longitudinal methods, SMMLING can not only decrease the modeling error but also identify more accurate and relevant genetic factors. Most risk loci reported by SMMLING are missed by comparison methods, implicating its superiority in genetic risk factors identification. Consequently, SMMLING could be a promising computational method for longitudinal imaging genetics.

Assuntos

Doença de Alzheimer , Humanos , Estudos Transversais , Doença de Alzheimer/genética , Neuroimagem/métodos , Encéfalo/diagnóstico por imagem , Fenótipo , Algoritmos , Progressão da Doença , Fatores de Risco , Imageamento por Ressonância Magnética/métodos

13.

Self-Reconstruction of Highly Degraded LiNi_0.8 Co_0.1 Mn_0.1 O₂ toward Stable Single-Crystalline Cathode.

Qin, Zuoyu; Zhang, Tao; Gao, Xuesong; Luo, Wuqing; Han, Junwei; Lu, Bingan; Zhou, Jiang; Chen, Gen.

Adv Mater ; 36(5): e2307091, 2024 Feb.

Artigo em Inglês | MEDLINE | ID: mdl-37857473

RESUMO

The ever-growing demand for resources sustainability has promoted the recycle of spent lithium-ion batteries to a strategic position. Direct recycle outperforms either hydrometallurgical or pyrometallurgical approaches due to the high added value and facile treatment processes. However, the traditional direct recycling technologies are only applicable for Ni-poor/middle cathodes. Herein, spent Ni-rich LiNi0.8 Co0.1 Mn0.1 O2 (S-NCM) to performance-enhanced single-crystalline cathode materials is directly recycled using a simple but effective LiOH-NaCl molten salt. The evolution process of the Li-supplement and grain-recrystallization during regeneration is systematically investigated, and the successful recovery of the highly degraded microstructure is comprehensively proven, including significant elimination of Ni2+ and O vacancies. Beneficial from the favorable reconstructed single-crystalline particles, the regenerated NCM (R-NCM) represents remarkably enhanced structural stability, electrochemical activity, O2 and cracks suppression during charge/discharge, thus achieving the excellent performances in long-term cycling and high-rate tests. As a result, R-NCM maintains the 86.5% reversible capacity at 1 C after 200 cycles. Instructively, the present molten salt can be successfully applied for recycling spent NCMs with various Li and Ni compositions (e.g., LiNi0.5 Co0.2 Mn0.3 O2 ).

14.

Species -Shared and -Unique Gyral Peaks on Human and Macaque Brains.

Zhang, Songyao; Zhang, Tuo; Cao, Guannan; Zhou, Jingchao; He, Zhibin; Li, Xiao; Ren, Yudan; Liu, Tao; Jiang, Xi; Guo, Lei; Han, Junwei; Liu, Tianming.

bioRxiv ; 2024 Feb 05.

Artigo em Inglês | MEDLINE | ID: mdl-37546923

RESUMO

Cortical folding is an important feature of primate brains that plays a crucial role in various cognitive and behavioral processes. Extensive research has revealed both similarities and differences in folding morphology and brain function among primates including macaque and human. The folding morphology is the basis of brain function, making cross-species studies on folding morphology important for understanding brain function and species evolution. However, prior studies on cross-species folding morphology mainly focused on partial regions of the cortex instead of the entire brain. Previously, we defined a whole-brain landmark based on folding morphology: the gyral peak. It was found to exist stably across individuals and ages in both human and macaque brains. In this study, we identified shared and unique gyral peaks in human and macaque, and investigated the similarities and differences in the spatial distribution, anatomical morphology, and functional connectivity of them.

15.

Position-based anchor optimization for point supervised dense nuclei detection.

Yao, Jieru; Han, Longfei; Guo, Guangyu; Zheng, Zhaohui; Cong, Runmin; Huang, Xiankai; Ding, Jin; Yang, Kaihui; Zhang, Dingwen; Han, Junwei.

Neural Netw ; 171: 159-170, 2024 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-38091760

RESUMO

Nuclei detection is one of the most fundamental and challenging problems in histopathological image analysis, which can localize nuclei to provide effective computer-aided cancer diagnosis, treatment decision, and prognosis. The fully-supervised nuclei detector requires a large number of nuclei annotations on high-resolution digital images, which is time-consuming and needs human annotators with professional knowledge. In recent years, weakly-supervised learning has attracted significant attention in reducing the labeling burden. However, detecting dense nuclei of complex crowded distribution and diverse appearances remains a challenge. To solve this problem, we propose a novel point-supervised dense nuclei detection framework that introduces position-based anchor optimization to complete morphology-based pseudo-label supervision. Specifically, we first generate cellular-level pseudo labels (CPL) for the detection head via a morphology-based mechanism, which can help to build a baseline point-supervised detection network. Then, considering the crowded distribution of the dense nuclei, we propose a mechanism called Position-based Anchor-quality Estimation (PAE), which utilizes the positional deviation between an anchor and its corresponding point label to suppress low-quality detections far from each nucleus. Finally, to better handle the diverse appearances of nuclei, an Adaptive Anchor Selector (AAS) operation is proposed to automatically select positive and negative anchors according to morphological and positional statistical characteristics of nuclei. We conduct comprehensive experiments on two widely used benchmarks, MO and Lizard, using ResNet50 and PVTv2 as backbones. The results demonstrate that the proposed approach has superior capacity compared with other state-of-the-art methods. In particularly, in dense nuclei scenarios, our method can achieve 95.1% performance of the fully-supervised approach. The code is available at https://github.com/NucleiDet/DenseNucleiDet.

Assuntos

Benchmarking , Diagnóstico por Computador , Humanos , Processamento de Imagem Assistida por Computador , Conhecimento , Aprendizado de Máquina Supervisionado

16.

Weakly Supervised Semantic Segmentation via Alternate Self-Dual Teaching.

Zhang, Dingwen; Li, Hao; Zeng, Wenyuan; Fang, Chaowei; Cheng, Lechao; Cheng, Ming-Ming; Han, Junwei.

IEEE Trans Image Process ; PP2023 Dec 20.

Artigo em Inglês | MEDLINE | ID: mdl-38117621

RESUMO

Weakly supervised semantic segmentation (WSSS) is a challenging yet important research field in vision community. In WSSS, the key problem is to generate high-quality pseudo segmentation masks (PSMs). Existing approaches mainly depend on the discriminative object part to generate PSMs, which would inevitably miss object parts or involve surrounding image background, as the learning process is unaware of the full object structure. In fact, both the discriminative object part and the full object structure are critical for deriving of high-quality PSMs. To fully explore these two information cues, we build a novel end-to-end learning framework, alternate self-dual teaching (ASDT), based on a dual-teacher single-student network architecture. The information interaction among different network branches is formulated in the form of knowledge distillation (KD). Unlike the conventional KD, the knowledge of the two teacher models would inevitably be noisy under weak supervision. Inspired by the Pulse Width (PW) modulation, we introduce a PW wave-like selection signal to alleviate the influence of the imperfect knowledge from either teacher model on the KD process. Comprehensive experiments on the PASCAL VOC 2012 and COCO-Stuff 10K demonstrate the effectiveness of the proposed ASDT framework, and new state-of-the-art results are achieved.

17.

NCSiam: Reliable Matching via Neighborhood Consensus for Siamese-Based Object Tracking.

Lai, Pujian; Cheng, Gong; Zhang, Meili; Ning, Jifeng; Zheng, Xiangtao; Han, Junwei.

IEEE Trans Image Process ; 32: 6168-6182, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-37938957

RESUMO

An essential need for accurate visual object tracking is to capture better correlations between the tracking target and the search region. However, the dominant Siamese-based trackers are limited to producing dense similarity maps at once via a cross-correlations operation, ignoring to remedy the contamination caused by erroneous or ambiguous matches. In this paper, we propose a novel tracker, termed neighborhood consensus constraint-based siamese tracker (NCSiam), which takes the idea of neighborhood consensus constraint to refine the produced correlation maps. The intuition behind our approach is that we can support the nearby erroneous or ambiguous matches by analyzing a larger context of the scene that contains a unique match. Specifically, we devise a 4D convolution-based multi-level similarity refinement (MLSR) strategy. Taking the primary similarity maps obtained from a cross-correlation as input, MLSR acquires reliable matches by analyzing neighborhood consensus patterns in 4D space, thus enhancing the discriminability between the tracking target and the distractors. Besides, traditional Siamese-based trackers directly perform classification and regression on similarity response maps which discard appearance or semantic information. Therefore, an appearance affinity decoder (AAD) is developed to take full advantage of the semantic information of the search region. To further improve performance, we design a task-specific disentanglement (TSD) module to decouple the learned representations into classification-specific and regression-specific embeddings. Extensive experiments are conducted on six challenging benchmarks, including GOT-10k, TrackingNet, LaSOT, UAV123, OTB2015, and VOT2020. The results demonstrate the effectiveness of our method. The code will be available at https://github.com/laybebe/NCSiam.

18.

Mutual-Assistance Learning for Object Detection.

Xie, Xingxing; Lang, Chunbo; Miao, Shicheng; Cheng, Gong; Li, Ke; Han, Junwei.

IEEE Trans Pattern Anal Mach Intell ; 45(12): 15171-15184, 2023 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-37756169

RESUMO

Object detection is a fundamental yet challenging task in computer vision. Despite the great strides made over recent years, modern detectors may still produce unsatisfactory performance due to certain factors, such as non-universal object features and single regression manner. In this paper, we draw on the idea of mutual-assistance (MA) learning and accordingly propose a robust one-stage detector, referred as MADet, to address these weaknesses. First, the spirit of MA is manifested in the head design of the detector. Decoupled classification and regression features are reintegrated to provide shared offsets, avoiding inconsistency between feature-prediction pairs induced by zero or erroneous offsets. Second, the spirit of MA is captured in the optimization paradigm of the detector. Both anchor-based and anchor-free regression fashions are utilized jointly to boost the capability to retrieve objects with various characteristics, especially for large aspect ratios, occlusion from similar-sized objects, etc. Furthermore, we meticulously devise a quality assessment mechanism to facilitate adaptive sample selection and loss term reweighting. Extensive experiments on standard benchmarks verify the effectiveness of our approach. On MS-COCO, MADet achieves 42.5% AP with vanilla ResNet50 backbone, dramatically surpassing multiple strong baselines and setting a new state of the art.

19.

PathwayTMB: A pathway-based tumor mutational burden analysis method for predicting the clinical outcome of cancer immunotherapy.

Li, Xiangmei; He, Yalan; Jiang, Ying; Pan, Bingyue; Wu, Jiashuo; Zhao, Xilong; Huang, Junling; Wang, Qian; Cheng, Liang; Han, Junwei.

Mol Ther Nucleic Acids ; 34: 102026, 2023 Dec 12.

Artigo em Inglês | MEDLINE | ID: mdl-37744173

RESUMO

Immunotherapy has become one of the most promising therapy methods for cancer, but only a small number of patients are responsive to it, indicating that more effective biomarkers are urgently needed. This study developed a pathway analysis method, named PathwayTMB, to identify genomic mutation pathways that serve as potential biomarkers for predicting the clinical outcome of immunotherapy. PathwayTMB first calculates the patient-specific pathway-based tumor mutational burden (PTMB) to reflect the cumulative extent of mutations for each pathway. It then screens mutated survival benefit-related pathways to construct an immune-related prognostic signature based on PTMB (IPSP). In a melanoma training set, IPSP-high patients presented a longer overall survival and a higher response rate than IPSP-low patients. Moreover, the IPSP showed a superior predictive effect compared with TMB. In addition, the prognostic and predictive value of the IPSP was consistently validated in two independent validation sets. Finally, in a multi-cancer dataset, PathwayTMB also exhibited good performance. Our results indicate that PathwayTMB could identify the mutation pathways for predicting immunotherapeutic survival, and their combination may serve as a potential predictive biomarker for immune checkpoint inhibitor therapy.

20.

Retain and Recover: Delving Into Information Loss for Few-Shot Segmentation.

Lang, Chunbo; Cheng, Gong; Tu, Binfei; Li, Chao; Han, Junwei.

IEEE Trans Image Process ; 32: 5353-5365, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-37729571

RESUMO

Benefiting from advances in few-shot learning techniques, their application to dense prediction tasks (e.g., segmentation) has also made great strides in the past few years. However, most existing few-shot segmentation (FSS) approaches follow a similar pipeline to that of few-shot classification, where some core components are directly exploited regardless of various properties between tasks. We note that such an ill-conceived framework introduces unnecessary information loss, which is clearly unacceptable given the already very limited training sample. To this end, we delve into the typical types of information loss and provide a reasonably effective way, namely Retain And REcover (RARE). The main focus of this paper can be summarized as follows: (i) the loss of spatial information due to global pooling; (ii) the loss of boundary information due to mask interpolation; (iii) the degradation of representational power due to sample averaging. Accordingly, we propose a series of strategies to retain/recover the avoidable/unavoidable information, such as unidirectional pooling, error-prone region focusing, and adaptive integration. Extensive experiments on two popular benchmarks (i.e., PASCAL- 5i and COCO- 20i ) demonstrate the effectiveness of our scheme, which is not restricted to a particular baseline approach. The ultimate goal of our work is to address different information loss problems within a unified framework, and it also exhibits superior performance compared to other methods with similar motivations. The source code will be made available at https://github.com/chunbolang/RARE.

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

RESUMO

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA